Project Aletheia: Verifier-Guided Distillation of Backtracking for Small Language Models
arxiv.org·4h
Experiments on Reward Hacking Monitorability in Language Models
lesswrong.com·4h
Making a Language
thunderseethe.dev·11h
CodeSOD: Validation Trimmed Away
thedailywtf.com·1d
Randomization in Typst
idraluna-archives.bearblog.dev·14h
t2x - a CLI tool for AI-first text operations
shruggingface.com·1d
Use of Assertions
blog.regehr.org·18h
Building a Self-Healing Data Pipeline That Fixes Its Own Python Errors
towardsdatascience.com·20h
Dealing with alternatives
jemarch.net·1d
Loading...Loading more...